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Abstract 

In this paper by calculating carefully the capacities (defined by high order Sobolev 
norms on the Wiener space) for some functions of Brownian motion, we show that the 
dyadic approximations of the sample paths of the Brownian motion converge in the 
p-variation distance to the Brownian motion except for a slim set (i.e. except for a zero 
subset with respect to the capacity on the Wiener space of any order). This presents 
a way for studying quasi-sure properties of Wiener functionals by means of the rough 
path analysis. 

1 Introduction 

It has been suggested that the recent theory of rough paths, put forward in T. Lyons [36], and 
developed further over the past years by T. Lyons with his coauthors, and other authors, see 
[35] . should simplify and strengthen the results in quasi-sure analysis over the Wiener space, 
which was initiated by P. Malliavin (see [ID] for example). In fact, the rough path analysis 
has direct applications in solving stochastic differential equations quasi-surely (see below 
for a precise meaning). What however is missing in literature is an approximation theorem 
towards Brownian motion sample paths by simple random curves in p- variation distance and 
in quasi-sure sense, i.e. except for a slim subset, instead of a probability zero set, see below for 
a definition of slim subsets in Wiener space. The main goal of the paper is to establish such 
a theorem (see Theorem 12.31 and Theorem 12.41 below). The results obtained in the paper 
allow to study quasi-sure properties for important Wiener functionals - solutions of Ito's 
stochastic differential equations. The main step in our proof is the construction of quasi- 
surely defined geometric rough paths associated with Brownian motion, which we believe 
has other applications, though not discussed in the present paper. It is known that there 
is an equivalence of capacity zero sets (defined in terms of Dirichlet norms of the Ornstein- 
Uhlenbeck process on the Wiener space) and the polar sets defined by the Brownian sheets. 
Therefore, with the quasi-surely defined geometric Brownian motion paths we constructed, 
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it is possible to define the rough path analysis for the Brownian sheet, thus provides another 
possible route to study stochastic partial differential equations via the rough path analysis, 
this potential application however is not pursed further here. There is increasing interest in 
applying rough path analysis to the study of stochastic partial differential equations, such 
as the recent papers [2J, [3], [5], [IS], [2D] and the references therein. 

In order to address the question we investigate in this perspective, we begin with some 
elements in the analysis of rough paths, and establish the notions and notations which will 
be used throughout the paper. 



1.1 Concept of rough paths 

Let (R d )® fc = R d £g> ■ ■ • cg)R d be the tensor product of fc-folds of the Euclidean space R d ; which 
may be identified with R fca! ; equipped with the corresponding Euclidean norm. For a positive 
integer n, T^(R d ) denotes the truncated tensor algebra 



®(R d )® fc , 

fc=0 

where (R d ) 00 = R 1 . 

Given T > 0, A = {(s,t) : < s <t <T}. T > will be fixed but arbitrary, so it will 
be assumed to be 1 without lose of generality. 

A continuous path w : [0,T] R d is said to have finite total variation on [0, T], if 



sup \w tl - w tl A < oo 
D i 

where sup^, takes over all finite partitions of [0, T\. 

D = {0 = t < • ■ ■ < t m = T}. 

This convention applies to similar situations below without further qualification. 

Let f2°°(R d ) denote the totality of all continuous paths in R d with finite total variations 
on [0,T]. If w E fi°°(R d ), the k-th iterated path integral over [s,t] 

w k s t = J dw tl ® • • • <g> dw tk . 

S<tl<---<t fe <t 

By definition, w\ t = w t — w s is the increment of the path w over [s, t], and for k > 2 

jfc-i 

w k st = lim w 3 s t, , ® t, 



are defined inductively. Collecting all k-th iterated integrals (up to degree n) together we 
define L n (w) : A ->■ T^ n \R d ) by 

L n (w) Sjt = (1, L n (w)l t , ■■■ , L n (w)$ t ), L n (wt t = V(s, t) 6 A 
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Ln(w) 1 is called the first level path of L n (w) which indeed recovers the original path through 
w t = wo + L n (w)l t up to the starting point. L n {w) 2 is called the second level path etc. Often 
L n (w) is written as w if no confusion is possible. 

L n (w) satisfies an important equation, called Chen's identity 

L n (w) St r ® L n (w) r>t = L n (w) Stt VO < s < r < t, 

where the tensor product takes place in the truncated tensor algebra T^ n \M. d ). It is indeed 
the reason that the zeroth term is taken as 1 in the definition of L n (w). Chen's identity is 
nothing but represents the additivity of iterated integrals over disjoint intervals. 
Let f2°°' n (M d ) denote the totality of all functions L n (w) where w runs through 

Q^ n (R d ) = L n (fi 00 ^)) = {L n (w) : w G 

which may be naturally identified with the space of all w G started from [or any 

fixed point in M. d ). 

Next step is to equip f2°°' n (]R d ) with a metric, and introduce the concept of geometric 
rough paths in R d . Let p > 1 be fixed and [p] denote the integer part of p, which relates to 
the roughness of sample paths. The interesting values of p are real numbers between 2 and 
3 for the study of Brownian motion in M. d . The p-variation metric, which is the key concept 
in the analysis of rough paths, denoted by d p , is a metric on defined by 



d p (L(v),L(w)) = max sup ( V \L(v) k t - L(w] 

l<A:<[p] D < A — ' 1 



where L(w) denotes L^{w) for simplicity. The completion of f2°°'[ p ](M d ) under d p is denoted 
by GQ p (R d ). An element in GQ p (M. d ) is called a geometric rough path in M. d of roughness p. 

If w = (1,1V 1 , ■ ■ ■ ,w^) E GQpiW 1 ), then w satisfies Chen's identity w StT . <g> w r ^ t = w Syt 
in T^(R d ) (for any 0<s<r<t<T), and w has finite p- variation in the sense that 



T. Lyons [36 



p/k 



< oo for all k < [p]. 
has demonstrated that a theory of integration for a geometric rough path 
may be established. Let w 6 GQ p (R d ) and / : R d -> L(R d , R d ) a function on M. d with values 
in the linear space L(M d ,IR d ) of all linear operators from M d to M d , where L(IR d ,R a! ) may be 
identified with M. d ® M. d or the Euclidean space M. dd . Such an / is called an IR d - valued 1-form 
on M d . Let f k denote the k — 1-th derivative D k ~ 1 f of / which is identified with a function 
on R d valued in L((R d )® k ,R d ) (where k = 1, • • - ,). In particular f 1 = f . 

If w G f2°°(lR d ), then y = (yt)te[o,T\ ^ fi°°(M d ) where y t = f f(w s )dw s is the path integral 
defined via the Riemann integral 

yt= l ^ ln ^2f( w ti-i)( w k- w ti-i)- ( L1 ) 

m(D)iO 

One of the main results in the rough path analysis is the following continuity theorem. 
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Theorem 1.1 (T. Lyons [36]) Suppose that f G C™ +1 (R d ; L(R d ,R d )), where n > 1 is an 
integer, and suppose that p > 1 such that [p] < n. Then the integration, which takes 
L(w) G n°°'W(R d ) to the lifting L(y) G fi°°<[ p l(M J ) of y defined by / TO) , is continuous 
with respect to the p-variation metrices. Moreover, the mapping L(w) — > L(y) is uniformly 
continuous on any bounded set of Q,°°'^(R d ). 

If w G f2°°(]R d ) and its lifting to a geometric rough path w = (1, w 1 , • - • ,w^), yt = 
f f{w s )dw s the usual Riemannian integral defined as above, then y G fi°°(IR a! ) (which is 
true actually for a Lipschitz continuous /). The lifting y = L^(y) is denoted by f f{w)dw. 
The previous theorem says that the Ito-Lyons integration w — )■ J f(w)dw is continuous with 
respect to the p-variation metrices. Notice that the usual path integral w — > J f(w s )dw s is 
in general not continuous under the uniform norm of paths. 

As a consequence, for w G Gtt p (R d ) and / G C^ +1 (M. d ; L(R d , R d )) we can definite its 
integral J f(w)dw as a unique geometric rough path in Gfl p (R ). It is however interesting 
to know how to define J f(w)dw directly by means of rough paths. 

Let us describe the definition for p G (2,3) which is the most interesting case as it is the 
case for geometric rough paths associated with Brownian motion. 

Since [p] = 2, so that we need to define two components y 1 , y 2 which defines a rough 
path / f(w)dw = (l,y\y 2 ) where w = (l,w\w 2 ) G Gfi p (R d ) and / G C^(R d ; L(R d , R d )). 
To this end first define y = (1, y 1 , y 2 ) by 

and 

It is understandable that y is not a geometric rough path yet (for example, it does not 
satisfy Chen's identity in general), so we take a limiting procedure to define y = (l,?/ 1 ,?/ 2 ) 
by means of Riemann sums but at the tensor level. More precisely define 

m(D)lO 

where the tensor product ® takes place in the truncated algebra T 2 (R d ), and the limit 
lim m (£))jx> takes over finite partitions of [s,t\. We then can show that the above limit exists 
and y = J f(w)dw. 

1.2 Differential equations driven by rough paths 

The most important result in the rough path analysis is the universal limit theorem for 
solutions of differential equations. Let R d and R d be two Euclidean spaces. Consider a 
system of differential equations of the following form 

d 

dyl = Y,fl(yt)dw\ (1.2) 

i=i 



4 



with initial data yo G M d , w = (w l ) is a continuous path in R d , and / = (//) : IR d — > 
L(M. d ,M. d ) is a function, called an M d -valued vector field on R. d , where for y G W 1 , £ G R d , 
(f(y)Cy = Yli=i fi(v)C- Suppose fl are Lipschitz continuous, and w G fi°°(R a! ), then the 
standard Picard iteration applying to the integral equation 

yi = yi + J2 f fiiys)d< 
i=i J ° 

allows to determine a unique continuous path y in M. d with total finite variation. This 
establishes a mapping sending w G to the solution y G f2°°(lR d ), denoted by y = 

F(y , w). We then lift both paths w and y to their corresponding geometric rough paths of 
the same roughness p, which thus defines a mapping which maps L^Aw) to L^(y) for each 
p > 1. We will again denote it as F(y , •). That is 

F(y ,L n (w)) = L n (F(y ,w)) Vu; G Q^ n (R d ). 

Theorem 1.2 (T. Lyons JSU/)Let p > 1 and f = (/)) G C^ +1 (1R^; L(IR d , R d ")). Then 

F(yo, •) : ^[ P ](w) L M (F(y , w)) 

zs continuous from f2°°'^(IR d ) to fi°°'^(]R d ) mi/i respect to the corresponding p-variation 
distances. 

This theorem ensures that there is a unique continuous extension of F on G ? r2 p (IR a! ), still 
denoted by F(y , ■), so that F(y ,w) = Lu (F(y , tt(w))) if w E GQ p {R d ). The continuous 
mapping 

to G Gfi p (M d ) -»■ F(y ,w) G Gfi p (M J ) 
is called the Ito-Lyons mapping defined by the differential equation (11.21) . 

2 Main results 

The analysis of rough paths, developed in T. Lyons [36], [35], can be applied to the study of 
stochastic differential equations with driven noises which are far more irregular than those of 
sample paths of semimartingales. On the other hand, Lyons' theory also sheds new insight 
on Ito's classical theory of stochastic differential equations, namely, stochastic differential 
equations driven by Brownian motion, as presented in [23] for example. 

In order to apply Lyons' rough path theory to Ito's theory of stochastic differential 
equations, it is necessary to enhance Brownian motion sample paths into geometric rough 
paths. The first construction of Brownian motion as rough paths was presented in E.-M. 
Sipilainen's Ph.D. thesis (1993) at University of Edinburgh, under the supervision of T. 
Lyons. B. Hambly and T. Lyons provide further examples in [!1| and [21]. They proved 
that symmetric diffusions with generators of elliptic differential operators of second order, 
and Brownian motion on the Sierpinski gasket can be enhanced into geometric rough paths 
of level two. In [6], geometric rough paths of level 3 associated with fractional Brownian 
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motions with Hurst parameter h > | were constructed by means of dyadic approximations, 
and in [35] further examples of geometric rough paths of level 2 or 3 associated a class 
of Gaussian processes and more generally a class of stochastic processes with long time 
memory are given. In particular, T. Lyons and Z. Qian [35J showed that if the correlation 
of a continuous stochastic process over disjoint time intervals satisfies a polynomial decay 
condition, together with further less important technical conditions, then the rough path 
analysis may be applied to stochastic differential equations driven by such a process. We 
notice that the correlation decay condition is a generalization of the martingale property 
which is the key in Ito's theory of stochastic calculus. We would like to recommend the 
reader for other constructions of rough paths to the books by T. Lyons, M. Caruana, and T. 
Levy [37], P. Friz and N. Victoir [12], A. Lejay's survey [32], and other papers listed in the 
references in [35] and |12j . 

In particular, the following theorem has been proved (for a proof see for example [35]). 

Theorem 2.1 If B is a Brownian motion in M d on a complete probability space (fl, J 7 , P), 
B = (1, B\ B 2 ) where B l s t = B t - B s and 

B\ t = [ odB tl ®odB t2 (2.1) 

for < s < t, defined in the sense of Stratonovich's integration, then, for any 2 < p < 3 

F{oo : B(u) G Gfi p (M d )} = 1. 

The goal of the present article is to prove a stronger result that B are geometric rough 
paths quasi-surely. To make it more precise, we need more notions and notations on Malliavin 
calculus, the capacity theory on the Wiener space. 

Let W denote the space of all continuous paths in K. d started at the origin, endowed 
with the topology of uniform convergence over finite time intervals. If we wish to emphasize 
the dimension d we will render our notations with superscript d. B(W) denotes the Borel 
cr-algebra, which can be described in terms of the coordinate process (B t ) t > on W, where 
for each t > 0, B t is the coordinate functional on W at time t. That is to say, if w G W, 
then B t (w) = w(t). In what follows, in order to avoid heavy notations, B t may be written 
as w(t) for w G W, Wt(w) or w(t,w) if no confusion may arisen. Similar convention applies 
to w as well: w may be considered as a typical path in ~R d or as the canonical coordinate 
process on W. Let J-" t ° to be the smallest cr-algebra over W such that all B s for all s < t 
are J^-measurable. Then J 70 = a{B t : t > 0} coincides with B(W). 

The Wiener measure P is the unique probability on (W, B(W)) which makes the coordi- 
nate process {B t )t>o a standard Brownian motion. Alternatively, P is the Gaussian measure 
on (W,B(W)) with the characteristic function 

I e i{l '- } dF = e-5l'l« for I G W* 
Jw 

where W* is the dual space of W, identified with a linear subspace of the Cameron-Martin 
space 

H = {hE L 2 (R + ,R d )\h(0) = and h G L 2 (M+,M' i )} 
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equipped with the Hilbert norm \h\n — y J °° \h(t)\ 2 dt. It is clear that any h G H has a 
continuous representation and there is a natural continuous imbedding H ■=->■ W. 

If h G H then £ fc denotes the Wiener functional on W defined by Ito's integration £ h = 
J °° h.dw. Then, ^ h has a normal distribution N(0, The translation : w — > w + h is 

measurable, and P o th is equivalent to the Wiener measure P, which allows us to define the 
Malliavin derivative of £ z in a direction h: 

^ <>oo 

^ / l i d{w i + etf) = (I, h) H , 
£ =o i=1 Jo 

Define _D£ Z to be H- valued random variable on W by requiring that {D^, h)n = Dh^i for 
all h G H so that _D<^ = / for any / G H. 

If / is a smooth Schwartz function on M. n , and /i, • • • , Z n G J?, then F = f(^ tl , • • • , ^ ) 
is called a smooth Wiener functional on The collection of all such smooth functionals 
is denoted by S. By forcing the chain rule to define the Malliavin derivative by 

3=1 

By iterating the definition we may define 

which is an i? <gl2 -valued random variable. We can define D S F inductively, see j2l] for more 
details. 

The Sobolev norms ||F||g,jv (f° r N = 0, 1, 2, • • •) is equivalent to Ylj<N \ \D^F\\p, where 

||^F|| g =^E{|^F|^}. 

The completion of smooth Wiener functionals under the norm 1 1 ■ | is denoted by 3 q N . For 
any q > 1 and integer N > 0, (JB> 9 N , \ \ ■ || g ,jv) is a Banach space, called a Sobolev space over 
the Wiener space W. The (g, iV)-capacity Cap^N is a function on the collection of subsets 
of W defined as the following. 

If A is an open subset of W, then 

Cap N (A) = inf {||w|| 9 ,jv : u G B> q N s.t. w > 1 P-a.e. on A and u > a.s. on 

and for a general subset A C W 

Cap qN (A) = inf {Cap qN (B) : 5 open and B D A} . 

For each pair (g, iV), Cap q ^ is a capacity on in the sense that the following hold. 

1. <Cap q<N (A) <Cap q , N (B) HACB, 
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2. Cap q ^ is sub-additive, that is Cap q ^(yy^ x Ai) < ^^Cop^jvCA)- 

If A is Borel measurable, then F(A) <Cap q ^{A) q for any q > 1 and N > 0. Therefore 
any subset of W with (q, iV)-capacity zero is null set with respect to the Wiener measure 
P. However there are many probability zero subsets which have positive (g, Incapacity. 
According to P. Malliavin, a subset A C W is called slim, if Cap q ^{A) = for all g > 1 
and N eN. 

We are interested in the random field B(w) = (1, w 1 , w 2 ) on W, valued in T 2 (R d ) 
parameterized by {(s,t) : s < t}, where 

wl )t = wt-w s , 

w 2 st = / odw tl <g) odw tl 

Js<ti<t 2 <t 

for all s < t, where od indicates the Stratonovich integration. 

We are now in a position to state an interesting consequence to our main result Theorem 

Theorem 2.2 For any p G (2,3) there is a version of B such that {B ^ Gf2 p (IR d )} is slim, 
that is 

Cap qN {weW : B(w) £ Gtt p (R d )} = 
for any q > 1 and N G N. 

It is well known and indeed it is very easy to show that there is a version of the stochastic 
integrals B 2 (defined by (12. ip ) so that B is well defined on W except for a slim subset (i.e. 
a subset with (q, iV)-capacity zero for all q and all positive integer N), and B is a rough 
path quasi-surely, that is, B(w) has finite p- variation and satisfies Chen's equation for all 
w G W except for a slim subset. Since 2 < p < 3, such a version of B allows us to develop a 
theory of stochastic differential equations and thus gives quasi-surely defined solutions for all 
stochastic differential equations with coefficients which are regular enough. However, such a 
theory will not ensure a convergence theorem such as Theorem 12.41 In fact, we will prove a 
quasi-sure approximation theorem for the Brownian motion. 

For a given natural number n, k = 0, 1, • • • , 2 n , t^ — k/2 n are the dyadic points in [0, 1]. 
For a continuous path w G W, w^ n ' is the polygonal approximation defined by 

wl n) = + 2»(f - t k - l ){w tkn - w t ^) for t G [t k n ~\t k n ] (2.2) 

for k — 1, • • • , 2 n . This notation equally applies to the coordinate process {w t : t G [0, 1]}. 

The idea of approximating Brownian motion by piece-wise smooth sample paths origi- 
nated from the fundamental research of P. Levy [33], [34] . also see K. Ito and H. P. McKean 
[26] for the construction of Brownian motion sample paths starting from polygonal paths 
with vortices modelled by random walks. 

For simplicity, if no confusion is possible, we will write u/ m ) for L 2 (w^), the enhanced 
geometric rough path of level two which is associated to w^ m \ Our main result may be 
stated as the following 
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Theorem 2.3 For any p G (2, 3) 

Ca PqjN l^weW : ^d p (™ M V m+1) ) = ooj = (2.3) 

for any q > 1 and JVgN. 

It is obvious that ( 12. 3 p implies that for any p between 2 and 3 there is a subset A C 
such that Cap q ^{A) = for all q > 1, iV G N, and wr" 1 - 1 converges in Gf2 p (IR d ) to a limit it? 
on W \ A, which is a modification of B. 

Putting together with Lyons' universal limit theorem, we obtain immediately the follow- 
ing quasi-sure limit theorem. 

Theorem 2.4 Consider the Stratonovich's type stochastic differential equations on the Wiener 
space (W, B(W), P) (so that the coordinate process w = (wt)t>o is a standard Brownian mo- 
tion) 

d 

dyt = /*(*") ° dw t + Mvt)dt (2.4) 
i=i 

with initial data y , /, = (//) (i = 0, • • • , d; j = 1, • • • ,N). Suppose f- and p are in 
Cfr(M. N ). Suppose for each m, y^ be the unique solution to the ordinary equation 

d 

dy t = Y fi(yt) dw t m) ' 1 + fo(yt)dt. 
i=i 

Then for any p G (2, 3) there is a slim subset A C W which is independent of \2.$ such 
that = L 2 (y( m ^) converges to y = (1, y 1 , y 2 ) on W \ A, and y t = y + y\ t is a version 
of the unique strong solution to \2.J$ . 

This kind of limit theorems for stochastic differential equations via ordinary differential 
equations in the context of almost sure sense have been discussed by E. McShane [42], E. 
Wong and M. Zakai [50], D. Stroock and S.R.S. Varadhan [47], and etc., see Section 7, 
Chapter VI in N. Ikeda and S. Watanabe [24] for a definite form and for further reference 
therein. By using Lyons' universal limit theorem, the Wong-Zakai type limit theorem has 
been extended to other rough differential equation driven by symmetric diffusions in [21J, 
[2T] , by fractional Brownian motions in [6] and by other Gaussian processes in [35] , [12] , and 
A. M. Davie [7J, [8] and etc. 

In the context of quasi-sure analysis, partial results (i.e. for a solution to a single dif- 
ferential equation or special Wiener functionals) have been obtained by T. Kazumi [2H], Z. 
Huang and J. G. Ren [23], J. G. Ren [44], P. Malliavin and D. Nualart [H], S. Fang [TO]. 
Our result is a natural generalization of the preceding mentioned results, and our negligible 
set is universal which is independent of the concerned Wiener functionals. 

The capacity theory on the infinite dimensional space W was first studied by P. Malliavin 
[5§] . In fact, Malliavin introduced the concept of slim sets as negligible sets - those subsets 
of W with (q, iV)-capacity zero for all q > 1 and positive integer N, by using the Ornstein- 
Uhlenbeck process on the Wiener space W. The current definition of Cap q ^ was gradually 
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developed through a series of work by I. Shigekawa [IB], H. Sugita [IB], M. Fukushima 
[13], [IB], PU, US], H. Kaneko [27], M. Takeda 03], J. G. Ren [15], F. Hirsch and S. Song 
[22] . see P. Malliavin [10] and K. ltd [25J for systematic expositions on slim sets, and M. 
Fukushima [14J, Z.M. Ma and M. Rockner [3B] and N. Bouleau and F. Hirsch [3] for the 
capacity theory defined via analytic or probabilistic potential theory. 

Many important almost sure properties which hold for Brownian motion were proved for 
the corresponding quasi-sure versions by D. Williams (see the article by P. Meyer [13]), M. 
Fukushima [IB] . M. Takeda [19], S. Fang [11] etc. S. Kusuoka [29], [30J initiated the study 
of non-linear analysis on abstract Wiener spaces by using capacity theory. 

The starting point in our approach is the capacity version of the Borel Cantelli lemma, 
that is, if YZ,=i Ca Pi,s{ A m) < oo then Cap q>s (lim m _ > . 0O A m ) = 0. 

Let p G (2,3) be fixed, and consider 

A m = jw G W : d p (w (m+1 \w {m) ) > C ( 
for some /3 > and constant C > 0. If we are able to show that 

oo 

Cap q>N (A m ) < oo (2.5) 

m=l 

then Capq^Qimm-^ooAm) = so that 





P X m+1) V m) ) = oo}< Cap^ilimm^Am) 



which yields Theorem 12.31 Therefore, we would like to estimate 

Cap q , N {weW: d p {w {m+1 \ w {m) ) > A} . (2.6) 

There are few techniques available to bound the capacity such as (I2.6P in contrast to 
corresponding almost sure statements. In fact the only effective tool to the knowledge of the 
present authors is the capacity maximal inequality (also called the Tchebycheff inequality, 
see 1.2.5 on page 92 and 2.2 on page 96 in [10]), which says that, if u G B> q N and if u is lower 
semi-continuous or continuous with respect to the capacity Cap q ^ s , then 

C 

Cap N {w e W : u(w) > X} < -^-\\u\\ q , N VA > (2.7) 

A 

where C q ^ is a constant depending only on d, q and N. This requires to estimate the Sobolev 
norms of u. Unfortunately, we are unable to show (and we do not believe it is true) that 
w — >■ eL 

( w (nH-l) >ti ,M) i s 

differentiable in the Malliavin sense. Instead we consider a metric 
over paths which dominates the p-variation distance, but differentiable in Malliavin sense 
and still good enough for Brownian motion. 

In what follows, p G (2, 3) and 7 > | — 1 be fixed. 
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If w = (1, i/; 1 , it; 2 ) and w = (1, w 1 , w 2 ) are two functions on A taking values in T 2 (W d ), 
we consider 

2" 



pj{W,W) 



oo 



n ' 



v n=l fc=l 



, k j.k—1 <k 



J. 

p\P 

3 



{21 



where j — 1 or 2. We will use pj{w) to denote Pj(w, w) with iu = (1, 0, 0). were invented 
and used in B. Hambly and T. Lyons [21] for constructing the stochastic area processes 
associated with Brownian motions on the Sierpinski gasket. These functionals were used in 
M. Ledoux, Z. Qian and T. Zhang [21] to show the large deviation principle for Brownian 
motion under the topology generated by the p-variation distance. The following estimates 
have been contained implicitly in [21] and made explicit in [35] and |31j . 



Lemma 2.5 Suppose 7 > ~ — 1. Then there is a positive constant C depending only on 7, 
d and p such that 



sup \ w Li,ti P ) ^ c Pi( w )> 



(2.9) 



sup 

D 



E 



w 



tj.—i M 



1_ 

p \ p 



< C ( Pl (w) 2 + p 2 (w)) 



where sup D takes over finite partitions D of [0, 1], and 

d p (w : w) < C max {p 1 (w,w),p 2 (w,w),p 1 (w,w) ( Pl (w) + p 1 (w))} 



(2.10) 



(2.1i; 



The idea in our approach is to replace d p by the right-hand side of (12. lip . Therefore, 
instead of considering the capacity of {d p A}, we consider the following 



pJw {m+1 \w im) ] 



(m+l)j „„("») J 



v n=l k=l 



2. 

P \ P 



where j = 1,2 and m = 1,2, 



Let 



( m ) / 
Uj '(w) 



Pj {w {m+1 \w {m) ) P l 



n=l k=l 



Then, clearly, for each N = 1,2, 



(m),N 1 x 



N 2 n 

E» 1 E 

n=l k=l 



(m+l)J _ ,„(m)J 

ifc— 1 



is continuous on W (which is equipped with the uniform norm over [0, 1]), and u^ a \w) = 

supjY Uj' N (w). Therefore U™ is lower semi-continuous on W, and moreover Uj G H>1 for 
any q > 1. Therefore we may apply (12. 7p to deduce that 



Ca PqA {p,(™ (m+1) , w^)l > A} < ^ 



pAw 



(m+l) ^M^f 



9.1 



(2.12) 
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and similarly 

Cap q<1 { Pl (w {m) y Pl (w {m+1 \w {m) ) p > A} 

< ^\\p 1 (w^y Pl {w^ 1 \w^y\\ q>1 (2.i3) 

for any A > 0, where C q > depends only on q and d. In the next section we will establish 
the necessary estimates to ensure (I2.5P for the case that N — 1. 

This approach can not be extended to (q, iV)-capacity case with N > 2, this is because 
of a simple reason that our dominated distance Pj(w^ m+1 \ ii/ m ))l does not belong to ~B q N 
for N > 2. We need new idea to estimate the (q, iV)-capacity for N > 2. The observation 
to get around this difficulty is that the capacity of {pj(w^ m+1 \ w^) p '^ > A} is "evenly" 
distributed over the dyadic partitions, which allows to reduce our task to estimating the 
capacities of some polynomials of Brownian motion sample paths, which will be explained 
in the last sectionof the article, where we conclude the proof of Theorem 12.31 



3 Some technical estimates 



In this section we establish several technical estimates which will be used in the construction 
quasi-surely defined geometric rough paths associated with Brownian motion. 

We are going to use the following notations. If J C [0, oo) is a finite interval, then 1 j 
is the characteristic function of J, and lj G H, which is Revalued function with the same 
component f Q lj(s)ds. Hence \1j\h — V^-^/PT where \ J\ denotes the length of the interval 
J. We will frequently use the following elementary fact: if { Jj : % — 1, • • • , n} is a family of 
disjoint finite intervals, then 



5>. 



i=l 



Vd 



H 



\ 



(3.1) 



1=1 



The corresponding fact for the increments of Brownian motion instead of characteristic 
functions is the context of the following lemma. 



Lemma 3.1 Then there is a constant C > depending only on d, such that 



N 



i=l 



< Cq^N 



(3.2) 



for any which are independent valued in M. d , with the standard normal N(0, 1^), for 

any q > 1 and N G N. 

This lemma follows from a simple application of the hypercontractivity of the O-U semi- 
group. 

The increment over the interval = (^ _1 ,^] of w G W is denoted by £ n (w) or simply 
by £ n (which denotes the functional w — > £ n (w) as well by abusing the notation), if no 
confusion may arise. That is 



S (i 



W±_ — Wk-i . 

2 n 2 n 



k 



(3.3) 
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For each n, since { J% : k — 1, • • • , 2 n } are disjoint, { : k — 1, ■ • • , 2 n } are independent, 
identically distributed with normal distribution iV(0, 2~ n I R d). 

Recall that = /^(w^) with first level component w^ 71 '' 1 and second level uA n )' 2 

respectively, so that 



,(«),2 _ r ,*.» ,,„» (3-4) 



I< tl < <2 <^<® ^fe- 



lt is easy to see that 



m (m),i _ / C for n < m, , , 
W tl-\tl-\ ^ ( "' m) forn>m 

where k = 1, ■ • • , 2 n , and in the case n > m, k(n,m) is the unique integer / between 1 and 
2 m such that 

t m —tn < < t-m- (3-6) 

In order to write down some formulas which will be used in what follows, it is better to 
use Possion bracket operations [, ] and {, }, that is, if £, 77 G M. d , then 

[£,77] = £®rj-r)®£ (3.7) 

and 

{^V} = Z®V + V®^ (3-8) 

while we will reserve the sharp bracket (a, b) to denote the scalar product in the Euclidean 
spaces, or in Hilbert space H® k . 

With these notations, we have (see Section 4.2 in [35] for details) 



for n < m, so that, if n < m, then 



(m),2 _ L k fk 

r,s=2 m ~ n {k-l)+l 
r<s 



2 m ~ n k 

(m+l),2 (m),2 



k — 1 j.^ '^j.k—l ifa 



I E ffi.CJ- ( 3 - 10 ) 



If n > m, then 



r =2 m -"(fe-l)+l 



1 n2m 

4 m -^ = ^^ (n ' m) ®^ (n ' m) - ( 3 - n ) 



3.1 Moment estimates under Sobolev norms 

In this part, let p G (2,3) is a constant, d is the dimension, n, m G N. We wish to develop 
several moment estimates for iir™t*i' 5 ' — w^f ' k (for j = 1, 2). 



13 



Lemma 3.2 There is a constant C depending only on d such that 



w 



fc-i 



< 



C (V^T^y forn<m, 



where j = 1, 2, 



w 



(m+l),l _ ,„(«i), 1 
w f k-i , k 



j-fe 



and 



for any q > 1 . 



(m+l),2 (m),2 



fc — 1 

n 5 r n 



< 



< 



/or n > m 

if n < m, 
C VV\H %fn>m, 

Cq 



(3.12) 



0. 



i 



2" 

2 2n 



ifn<m, 
if n > m. 



(3.13) 



(3.14) 



Proof. For simplicity, let Yj = w \_[ k and Xj 
follows from the fact that 



(m+l),7 



Forj = l, 



l*i| 



^rll^l, forn<m 
for n > m 



(3.15) 



where £ ~ iV(0, Ircj), and the fact that ||£|| 9 < C^fq for some constant C depending only on 
d. For j = 2 and n > m, (13.121) follows from (13.1 ip directly. Consider the case that n < m. 
According to (13. 9p . we need to estimate 



E 



f r (5?) f s 



r,s=2 m -"(fc-l)+l 
r<s 



s-1 



E E 

s=2 m -"(fc-l)+l r=2 m - n (fc-l)+l 



Since £^ (8) £^ belong to the second chaos component for whatever m, so that I q < C\qli for 
some constants C\ and Ci depending only on d, but independent of m or u. Therefore we 
may assume that q = 2. Furthermore, for simplicity, set r] s = ^r=2 m -™(fc-i)+i £m- Then £ s 
has a normal distribution with mean zero and 



Thus 



var(r? s 



E ^ ® 

s=2 m -"(A;-l)+l 



IsSra 



_( s _2"*-»(jfe_i)). 

2 m v v 



E E E 

*iJ=l \ v s=2 m -™(A;-l)+l 
d 2 m ~ n k 

e e E (^r 

i,J=l s=2 m -™(fc-l)+l 
d 2 m -"fc 

+ 2 E E E 

i,i=l s=2 m -"(fe-l)+l 



14 



The last term has contribution zero. This is because for r > s, Cm * s independent of Tf s ^r] l r . 
so that 

E UO''i:a) = E (t&#t£) EC = for a < r. 

Hence 



E ^ S ® e 

=2 m -«(fc-l)+l 



d 2 m " n fc 

E E E (*a<) 2 

i,i=l s=2 m - n (fc-l)+l 
^ 2 Tn ~ n k 

E E e (^e«) j 

2,j=l s=2 m - n (fc-l)+l 

d 2 V -l( s -2 m -"(A;- 1)) — 
/ ^ 2 m 2 m 

s=2 m -™(fc-l)+l 

-^ 2 y s < c4- 

o2m / - > — o2n 



s=l 



and therefore 



2 m-n fc 



E 



r,s=2 m -™(fc-l)+l 



< C2O — 



for some constant C depending only on d. By (13.41) and the preceding estimate we have 



211,, < 2 11^" ® + 2 



2 m-n fc 



IS m 5 S r 



r,s=2 m -™(fc-l)+l 
r<s 



2 m-n k 



E c ® c 



r,s=2 m -"(fc-l)+l 



- 2 n 

Remain to show (13. 14)) for the case n < m. Indeed, by ( 13 . 10)) and (13. 2p 



(m+l),2 _ (m),2 



1 

2 

< Cg 



2 m-n fc 



V ^ 2r_1 f 2r 1 

r =2 m -"(fe-l)+l 



which completes the proof of the lemma. 
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Lemma 3.3 There is a constant C depending only on d such that If n > m, then Y\ 
^Hrhm) DYl = ^l jk(n , m) Ifn < m, then Y x = £ so that DY 1 = l Jh 



J - /w ,k-l , k 

l n i l n 



< 



H 



C\h ifn<m, 



if n > m, 



(j2 m I 1 



2 n V 2 



d ( w^T-^Y - w'™ 1 ; 1 k 



< 



H 



if n < m, 

C \2^V2^ l J n>m 



(3.16) 



(3.17) 



and 



< 



2-a f&m rr lfn>m 



^ "\f y A / o3n \ I e yn-\-m 

for any m,n G N, and k — 1, • • • , 2 n , a — 1, 2. 



(3.18) 



Proof. ( I3.16P is easy to see. Let Xj = w \_ 1 fc — w k Jf k for simplicity. If n < m then 
X\ = 0, and if n > m, then 



DX\ = 1 fc(n,m+l) — —1 Mn,m) 

Z J m+1 Z m 



\DX 1 \„ = Ci 



/ 2 m / 1 




so that 

nv 

lH V 2 n V 2 r 

where C is a constant depending only on d. Next we consider the Levy area X 2 . If n < m 

2 m-n fc 

2 



i=2"»-»(Jfc-l)+l 



so that 



and 



where 



2 m-n fc 



j2l 

"m+1' 



Z=2 r "-"(fc— 1)+1 



D^Xo = — 7 ( [1 r 2!-l, 1 r2i 1 — fl 72i , 1 r 2!-l 

Z 2 ^ V J ™+1 m +! m +! J m+1- 



l=2 m - n (k-l)+l 



[1j21-1,1j2Z }(t 1 ,t 2 ) = [l J 2l-l(t 1 ),lj3l (t 2 )]. 



'm + \ m+1 



m+1 m + 1 
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\DX. 



2\H — 



IE 



Since the intervals J^+i are disjoint, so that 

/ 2 m - n k 
i,j y=2 m -™(fe-l)+l 

zE E ([ij^.^+i]^ 



< 



t2( 



5 z* roii^i.^n'+iKsi.i^ji 2 ) 

7 rim—n/J- 1 \ i 1 



j,j l=2 m - n (k-l)+l 



l=2 m - n (k-l)+l 
2 m - n k 



< V (\£ 21 i 2 + if 2 '- 1 \ 2 ) 

— 2 rn+1 2.^1 U*»m+ll ^ ISm+ll ^ 



«=2 m -"(fe-l) + l 



that is 



Similarly, 



|M 2 |h < 



2m+l 



i 



(iCn+lP + l£m+l| 2 )- 

=2 m -"(fc-l)+l 



OO /-OO 



|L> 2 X 12 



2 1 if® 2 — 




2 m -"k 



./0 



i E E - 1 1 ^' ^ 



+i 




Jo 



OO />00 



< 




'0 JO 

C2 m-n 



i,j y=2 m -™0-l)+l 
2 m -"fc 

j E E" 1 ^ 1 

l=2 m -™(k-l)+l i,j 

2 m-n fc 

E i^-e^-eJ 

l=2 m -"(k-l)+l 
1 



72i 1 — [1 j2l ,1 7 2i-ll I ' 
m+1 ' J m+l J J m + 1 7 J m+l JI 



2 2(m+l) 

where C depends only on d, so that 



|D 2 X. 



2 1 if® 2 



< c 



1 



Hence, for q > 2 we have 

l|M 2 |L < 



2m+i 



\ 



2 m ~ n fc 



E/ (l^m+ll 2 + ICra+ll 2 ) 

«=2 m -«(fc-l)+l 



< 



2m+l 



V 



2™-™fc 



E/ (I l^m+ll I 2 + I ICm+ll I 2 ) 

=2 m -"(fc-l)+l 
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and 



\\d 2 x 2 \l<c 



y / 2 m+n 



Vn < m. 



(3.21) 



If n > m then 

DX 2 



12 



2m 



4{e 



fc(n,m+l) 



so that 



\DX 2 \ H < C 



,-).-):>:, \ MSni-J ' , l^fcKm+l)} ' \ 1 jHn,m) } J . 

12 



2m 



2 2 2n 



I 4{1 ,fc(n,m + l) , 1 ,fc(n,m+l) } — {1 jk(n,m) , 1 ,k(n,m) } I 
V J m + 1 J m + 1 Jm J ™ / 



^ 3m /J_ / |tfe(n,m+l)| _|_ |tfe(n,m) 
23n y 2" I 



\D 2 X 2 \ H < C 



)2m 



2 2n 2 r 



for a constant C depending only on d. After taking g-norm, we obtain (13. 18ft . ■ 

Lemma 3.4 Let N > 0. Consider the following junctionals f mnk on the Wiener space W 
defined by 

2N 



3 

m,n,k 



W 



,(m+l),j 



Uk-X , fc 



for w G VV 



// 2N > 1 i/ien t/iere is a constant C depending only on d and N such that 



l ff ( 2 m 

\ D fL, n ,k\\ q < Cq N ( 



JV 



/or n > m 



and 



\DflnA < 



(y^rj^ ifn<m, 



(3.22) 



(3.23) 



(3.24) 



m . 



for any n, m and k = 1, • • • , 2 n . 



Proof. Let X< 



(m+l),j _ 



(j = 1,2) as in the proof of the previous lemma. 



Since 2N > 1 so that f 3 mnk are differentiable in the sense of Malliavin calculus, and by chain 
rule 



so that 



DfL,n, k = ™\X j \* N -V (X i; M,) 

i2JV— 1 



\Dfl wn , k \ H <2N\X 3 r- 1 \DX 3 \ H . 
Thus, choose a > 1 such that /3 = aq(2N — 1) > 1. Then, by using Holder's inequality 

||^,n,,|| g <2iV||X,||f- 1 ||M J || gQ/ (3.25) 

where ± + ^ = 1. Using fl3TT3|) and fIBTTTj) to obtain 

2 n V 2 r ~ 



2 m / 1 



and 



\\Dfl n „ 



< 



2.2V- 1 x 
2-' ! ■■- j ( V 2" 1+n 



1 



1 



i 



if n < m, 
if n > m 



thus (I3.23P and ( I3.24p follow immediately. 



3.2 Estimating capacities 

Lemma 3.5 There is a constant C depending only on p, 7 and d swcn tnat 



W 1 



< Cg2 Vm G N, g > 1 



where j = 1, 2. 

Proof. (I3.26P follows from the following inequality 



n=l fc=l 



P9 



and Lemma 13.21 applying to the L P9 -norm. ■ 

Lemma 3.6 There is a constant C depending only on p, 7 and d such that 

,(m+l) ^M^f 



for all m G N, and j = 1,2 and q > 1. 



(3.26) 



(3.27) 



Proof, thus, together with (I3.12p . there is a constant depending only on d such that 
Therefore, by applying these estimates to L pq norms we deduce that 



|p 1 (it; {m+1) ,^ (m) ) p || (? < 



n>m k=l 

00 



(m+l),l (m),l 

j.fe— 1 .fc j.fe — 1 ifc 



< ^fWi 



n>m 



p-2 
2 



P-2 
1 > ^ ~ 

2™ 
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Similarly 



oo 2" 



71=1 fc=l 
OO 



(m+l),2 (m)J 
j.k—1 /fc tfc— l ^ 



Vn<m 

oo 



Cg2 n 7 A/ — ( — 
^ / ^ y 2 m \ 2 m 

n<m 



< 



< CgS — 



p-2 p-2 



n ' 



n>m 



P-2 



Lemma 3.7 There is a constant C depending only on p, 7 and d such that 

\\ Pl (w^r\\ qA < Cq* 

for any q > 1 and m e N. 



Proof. Let Fi = w { ™}>\. Then, by (ET511 . DY 1 = l Jk (if n < m) or |£l , fc(i 



so that 



p 1111% = pii^ii: (p -i)I^iIh 

77pll£ll(p-l)q 



p-1 



^- for n < m 



(lFll£ll(p-i)<?) forn> 



where £ ~ iV(0, Ir^). Hence 

\\\D Pl {w^y 



00 2 n 

71=1 fe=l 

■DO 



(m),l 



< 



71=1 ^ ' 



1-1 



Lemma 3.8 Taere a constant C depending only on p, 7 and d such that 



/or any m, q > 1 and j — 1,2. 
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(m+l),j _ 

f[3T24j) with 2iV = £, we may deduce that 



Proof. Let u 3 k (w) 



. According to (l3~T3]) . ( ED , ( l3~23j) and 



m,n,k 1 1 „ 



£ /2 



) m \ 2 



2 2r 



for n > m 



and 



m,n,k 1 1 o 1 



< 



C<p (^ht) 4 if n < m, 
C<P (f^ ) 5 if n > m . 



On the other hand, by triangle inequality 



Pj (w {m+1 \w^)l 



oo 2 n 



n=l fc=l 



It follows thus that 

oo / 1 



p-2 
2 



< 



P-2 



n>m 



1 \ 4 



and 



n=l 



n>m 



P-2 

4 



^ 2 (- (m+1 w m ¥[ < ^ (E- 72n (^r) 3 + E- 72 "(^ v 



where constants C may be different from line to line but only depend on p, 7 and d. ■ 
Finally we need an L 9 -estimate for the malliavin derivative of p 1 (tu^) p p 1 (iu( m ' +1 \ w^) p . 



Lemma 3.9 There is a constant C depending only on p, 7 and d such that 



{m)\p n ( w (m+l) w (rn)\p 



:m) ) p \\ ,<Cql(- 

> 1 1 0, 1 — ^ \ Or; 



P-2 



(3.30) 



for any m G N. 



21 



Proof. Firstly, by Cauchy-Schwartz's inequality 



(ttf (m) ) p Pi(tD (m+1) , w (m) ) p \\ q 
< ^ II Pi (™ (m) ) p II II Pi ( w(m+1) » wim) ) p I 



2g 



p-2 



Similarly, by chain rule, and Cauchy-Schwartz inequality 

||D { Pl {w {m) Y Pl (w {m+l \ v)W) p ) || 
< ||p 1 (t«( m )) p Dp 1 (ti/ m+1 \ ™ (m) ) p || g 
+ Hp^w^ 15 , w (m) ) p Dp 1 (ii;( m J) p || 



I2q 



+ ^\\Dp 1 (w( m ^)P\\ 2q ^\\p 1 (w( m+1 \w( m ))P\ 



2q- 



together with (13T26D . (13T28]) and (13T29]) we obtain ( ET50j) . ■ 

Recall that p e (2, 3) and 7 > ~ — 1, and C Pi7 the constant appearing in ( 12. 5p . 

Theorem 3.10 Suppose (3 G (0, i/ien 

£ ^,,1 jci P (™ (m+1) V m) )| > C Pi7 < 00 . 

Proof. By using our basic estimate ( I3.29P 

Cap ql \ Pj {w {m+1 \w {m) ) >\v\ = Cap ql \ Pj (w^ m+1 \w^)J > A 



< 



< 



pAw 



(m+l) w {m)\* 



9,1 



C { 1 N 4 



A \2 r 

where C is a constant depending only on d and p. Choose A such that 

P 



to obtain 



X>/p 



Cap qjl \ Pj (w^ +1 \w^)> 



< C 



P-2 pP 



Since 2^ - ^ > so that 



^Ca Pg ^\ Pj (w^\w^)\ > <oo. 



(3.31) 
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Similarly 



Cap ql (\ Pl {w^) Pl (w^ m+1 \w^)\ > \ 1/p 



< \ \\p 1 (w im) yp 1 (w^ 1 \w( m) ) p \\ . 

v P- 2 

c i — 



< — , 

- a V 2 m 



so that 



Cap qA { P Jw( m+1 \wW) >(!-) \<c n ' 



2 m / ~ V 2 



Now (13T3B follows from (T23]). ■ 
Corollary 3.11 Suppose p e (2,3), i/ien 



Cap g)1 <^ d P {™ (m+1 \ ™ (m) ) = 00 I = 0, Vg > 1. 



. m=l 



We have thus proved (I2.3P for N — 1. 

4 The proof of the quasi-sure convergence 

Guided by the estimates we have obtained in the previous section, we wish to show that for 
every pair q > 1 and iVeN 

for some choices of /3 > and £ > 0, where C and C" are two constants independent of m. 
Therefore we are interested in the capacity 

I,-(m) = Cap qtN {pAw^ +l \w^)i > A 



(i = 1,2). 

Since 

00 2™ 

Pj (^(™ +i ),^( m ))f = E n7 E k ( P 1),j "" (m)J 

n=l fc=l 

so that, for every # > 

00 ( 2 



+ fc ifc — 1 .fc 



^»v->)5 > a c u E h* 5 * - 7 > C » A (f) < 41 > 

n=l lfc=l V 7 I 
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where 



Therefore 



Ca 



(m+l)j _ n {m),j 
J fk-1 +k 



n=l 

oo 2 n 



. fe=i 



(m+l),j „„(m)J 

n=l fc=l 

On the other hand, for any N > we have 



> AC, 



> AC, 



3+1 



(4.2) 




(m+l),J ( m )j 

'^j.k — l .fc — 1 



> AC, — 
2™ 



w v fe , ; — w\_l , 

+ K 1 +fc -f- +k 



> xcs I - 



A^Cf 



1 \ 



2 V 




where 



fm,n,k( W ) 



(m+l),j 
W f k-1 , fc %fc-l , fc 



,( m )j 



2iV 



for w G . 



If TV is a natural number, then f 3 mnk are polynomials of the Brownian motion paths, so are 
smooth functionals on the Wiener space W in Malliavin's sense. This latter fact allows us 
to apply the capacity maximal inequality to bound the preceding capacity. Namely, for each 
pair q > 1 and N e N, we have 



Cap 



q,N 



(m+l),j 

>^ - w\ k >[ J 



> XC e ( — 

2™ 



< C 



X7CS ' ¥ 



-, -2AT 



where C depends only on d, q and N. It thus follows that 



ll/f 



m,n,k 1 1 



(4.3) 



oo 2" 



J,(m)<C^^ 



n=l fc=l 



A^Cf 



1 \ ^ 



-2N 



(4.4) 



Therefore, we need to estimate the Sobolev norm \ \f^ nn k\\q,N i n order to prove our main 
theorem 12.31 and we will see that there is a good reason (see the constraint (I4.18P below) 
that we need to raise the power of — W k-±. to 2N for large enough N. 

To this end, we first need to evaluate higher order Malliavin derivatives of fL n u- For 



simplicity, let Xj(w) 



w 



(m+l)j 



- w { ™]?. and Y^w) = w™ h (j = 1,2). Suppose N eN 



,( m )J 
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is chosen, and consider / = |X| 2Ar where X = Xj or Yj (j = 1,2). In all these cases, / 
is a polynomial of Brownian motion path, and thus is smooth in the Malliavin sense. In 
particular, / G V> q N for any q > 1 and N G N. We want to find an upper bound for the 
Sobolev norm ||/||g,jv- 
If M < N, we have 

M 

D M f = J2 C ai ... a jX\ 2 ^-^D ai \X\ 2 ®---®D a »\X\ 2 (4.5) 

/Lt=l ai+-+a li =M 
4>«i>l 

where C ai ... a are constants depending only on a's, j = 1 or 2, M and N. Therefore, by 
using Holder's inequality, we have 

N M 

H/IU < CYsY. \\\X\ 2(f! -^D a i\X\ 2 ®---®D a »\X\ 2 \\ q 

M=0 M=l QiH ha M =M 

4>«,;>0 

N M fj> 

£ C EE E iwi^nii^wn*. < 4 - 6 > 

M=0 /x=l aiH ha M =M i=l 

4>ai>0 

for some constant C depending only on N, N, where the restriction that 4 > aj > comes 
from the fact that \X\ 2 is a polynomial of Brownian motion of order at most four as X = Xj 
or Yj, so that .D a |X| 2 = for a > 5. The inequality (14. 6p . though completely elementary, 
allows us to develop the necessary estimates for the Sobolev norms we are interested. 



Lemma 4.1 Suppose X is a smooth Malliavin functional, and D a X = for a > 3, then 

\D\X\ 2 \ H < 2\X\\DX\ H , I^IX] 2 !^ < 2\DX\ 2 H + 2\X\ \D 2 X\ H m (4.7) 

and 

\D 3 \X\ 2 \ H&i <Q\D 2 X\ H m\DX\ H , \D 4 \X\ 2 \ H m <Q\D 2 X\ 2 H ® 2 . (4.8) 

Proof. These estimates follows from the chain rule directly. ■ 

Lemma 4.2 Let m,n G iV ; and k = 1, • • • , 2 n . Let Yj = w^}^ k and Xj = w^X l \^ —w ^-i k 
(j = 1,2). Then, there is a constant C depending only on d, such that for any q > 1 

IMnHI < I C rf q nk I n > m ' (4.9) 

11 llq ~ l C^q± ifn <m, v 1 

\\D a \X x \ 2 \\ < ( % n > m ' (4.10) 

5 ^ if n < m y ' 

llr^l y l 2 ll ^ i C Vt~ h f^ forn>m, , . 

llD]X2] C^f-^ fom<m ^ 



and 



where a = 1,2 and b = 1,2, 3, 4. 
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Proof. If n > m, then Y 1 = |^ {n ' m) , DY 1 = ^1 _* (B>m) and D 2 Y X = 0, so that 

* ^ 'Jm 

I^I^IV < 2|Y 1 ||2?Y 1 | H = 2-- V /-|e i ' m) | (4.12) 



which yields 

om 

IIW|| 9 <<v^ 

where C depends only on d. Similarly 



D'\Y X \'\ H ^ < 2\DYtf H = 2—l jkJn , m) 



so that 



D 2 \Y 1 \ 2 \\ <2^- 



2 2n 



If n < m, then Yi = £ n so that DY\ = ljk, hence 



where C depends only on d. This proves (14.91) . (14. 10p follows (14.91) and the fact that X\ = 
if n < m. 

Together with Lemma \A. II and the L 9 -estimate (13. 14j) for X 2 , we can conclude that there 
is a constant C depending only on d such that 



Vn < m 



\\D a \X 2 \ 2 \\ <CJq~ a ^— 

II 1 z 1 1 1 q — V 1 2 m + n 

for a = 1, 2, 3, 4 and g > 1. 

Now consider the case that n > m. In this case 



1 o2m 

y- _ ? „( m ),2 _ f f tfc(n,m) ~ efc(n,m) 



(4.13) 



so that 
where 

It follows that 



1 22m -j^ 22m 

DY2 = 2 2F^ 1 ^ n - mh ^ n ' m) ^ D2Y2 = 2 22^^ 1 ^ ( ™' m) ' 1 ^ ( "' m) ^ 



{ 1 jk(n,m) , 1 jk(n,m) J - (tj , ^2 ) — {1 .tin,™) (tl), 1 ,t(?i,m) (^)}' 



4-6' 



i2m 



2 4n 



for n > m, 6 = 1, 2, 3, 4. 



and therefore (14. lip . ■ 

In what follows we assume that iVeN and N < N. Let 



rn,n,k 



W 



(m+l),j _ (m)j 

fc-l fc w h = l k 

2 n i 2 n 2™ ' 2™ 



2iV 



> 9m,n,k( W ) 



W 



{m),j 



2N 



(4.14) 



26 



Lemma 4.3 There is a constant C depending only on N, N and d such that 

(nm \ N 

—J forn>m (4.15) 

and 

\\9 1 mnk \UN<{ C f&f f ° rn>m (4.16) 

for all q > 1 . 

Proof. If X\ = iu*™* * — w^j 1 ' 1 h (for n > m, otherwise Xi = 0). By (14. 6p 



N M n 

\\fm,n,k\\q,N < C ^ 1 1^ 1 1 Z^N-^) II I ^ l^ 1 ^ 1 1 2 ^ 

M=0 m =1 HH ha M =M i=l 

2>a l >0 

£ c££ E (^J (4 

i\/=0 /U=l oiH ha M =M V 

4>ai>0 



2 »n \ ^ 



- ^ ( ^ ] for n > m. 



The same estimate remains true in the case that Y\ = w)™^ 1 k and n > m. On the other 
hand, if n < to, then 



N M n 

<cEE £ iwi^nira* 

M=0/i=laiH ha M =M i=l 

2>a;>0 



l| I|2/i9 



< 



_ JV M / 1 V / 1 X (Ar_Al) 

^ ( 2™ ) ( 2" 



M=0 /x=l oi H h<v=M 

2>aj>0 



Cq. N ( ^ ) fo r n — m ' 



Lemma 4.4 There is a constant C depending only on N, N, d such that 
for all q > 1 . 



m,n, k \U,N<< CV t N S^l\f ) f° rn ^ m > , (1.17) 
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Aq(N - (i) and fl4TTTD 



Proof. Let X 2 = w^^ l ]f — w^{'\ . By (14.61) and the L 9 -bounds of X 2 applying to 



\iq(N-fM) 



< 



Cq 



if n > m 



and 



we obtain, for n < m, 



\D b \X 2 \ 2 \ 



< 



N M 



Cq 22 



for n > m 

2w ~ i Cq 2 ^ for n<m, 



2 4 " 



ll/m,n,fcllg,JV < C ^ ^2 ^2 



M=0 fi=l oi H ha M =A/ 

4>ai>0 



4N 



1 



2m+n 



N 



Similarly, if n > m then 



2?rt+n 



2{N-n) 



N M 



\\f 
1 1 J m,n. 



k\\q,N 



< 



C EE E «k 



M=0 M=l aiH ha M =Af 

4>a;>0 



2" 



\ 2(iV-/i) / 9 2m\ M 



! 2 2n y 



< Cy/q 



•IN 



2 2n 



2iV 



9 



2 4n 



Proposition 4.5 Choose N e N and /3 > suc/i i/iai 

P p-2- -I 

P-2-4 >0,/3 + fl< " 
iv * 



(4.18) 



Then for any N < N there is a constant C depending only on N, N, 8, (3, d and q > 1 such 
that 



. k=l 



fh-l ,k 



J >C B ' 1 



for n, m G N, k = 1, ■ ■ • , 2 n , where 

'p-2 



(0 + P) 



2jN 
p 



< C 



1 



2max{m,n} 



(4.19) 



-1, J = 1,2. 



(4.20) 



28 



Proof. For each fixed k = 1, • • • , 2 n , consider f 3 mn k (w) 
is continuous on W. Thus, according to the capacity maximal inequality 



(m+l),j 

t-k — 1 4-k j.k — 1 .fc 

Tl '71 1 



fc-i , 



2iV 



CaP q ,N { fL,n,k > 



(m+1) ,j „„(m),j 



V" 1 *fe 



1 

2 m 



1 \ 



1 

2" 

2JV 



< C 
On the other hand 

Ca Pq,N { Yl 



I \ p 



1 \ 



-, -2AT 



llf j 

1 1 J m,n. 



k\\q,N- 



Xm+l),j 



. fc=l 



E / 1 



< 



q,N 



(m+l),j n ,A m )'3 



>C 9 



>Co 



k=l 



1 

2 m 



1 

2" 



9+1 



It follows from f l4TT|) that 



k=l 

I 2'™/ 1 2™/ 



>C 



< < 



< c 



C2 n 
1 



-27V 



\2 n ) \2 m ) 



\2 m ) \2 n J 



-2JV 



( 2 m \ 
\2 2n ) 



2N 



for n < m, 



for n > m 



p-2 



2/V-AiV(/3+0)-l 



for all n and m. 



Similarly, for j = 1 and n > m we have 

2 n 



< CT 



< CT 



< C 



. k=l 

or 



(m),l 
W f k-1 fk 



1 \ 



2" 



-2N 



l\\q,N 



ffl 2^ 

£_2iV-|7V(e+/3)-l 



1 \ p 



1(0+1) 



-, -2AT 



2 m 
2 1 ™ 



which completes the proof. 



which 
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Proposition 4.6 Let p G (2, 3), g > 1, N G AT, and /3 G (0, ^). T/ien, /or any e > i/jere 
is a constant C depending only on p, d, q, N and (3 such that 



Cap qjN t Pj {w^ m+1 \ w^) > ( — ] ) < ( — 



J \ V 



1 

2 m 



(4.21) 



for all in GN and j = 1, 2. 

Proof. Choose 6 > and N G N such that N > N, 

p-2-l>0,P + 6< P -^-Jl 
y N 2 2N 



and 



p-2 



(0 + 0) 



2iV 
p 



- 1 > 2e 



Then according to Proposition 14. 5[ there is a constant C depending only on N, N, 8, f3, d and 
q such that 



Cap^{pJw^ +1 \w^)l > (-- 



n=l 



< 



n=l 



k=l 
2e 



^ ( Om ) ^ ( On 



n>m 



2e 



< c 



This proposition shows the capacity of {pj(w^ m+1 \ w^) > 2~- 7/3m//p } for small (3 > 
decays sub-exponentially in 2~ m (in contrast with the decay rate in (13. 29 j) which is indeed 
not a sharp estimate). This is the right order for j = 2. In the case j = 1 and for capacity 
Cap2,i, this result was established by M. Fukushima [T6] . 

Lemma 4.7 Let p G (2, 3), q > 1, N G N, 5 > and N G N such that 



N < N, AM1--J-1> 0; 

there is a constant C depending only on N, p, d, 8, q > 1 such that 

/ 1 \ ~^ 

Ca Pq , N { Pl (w {m) ) > {2 m )"} < C (—) " Vm G N. 



(4.22) 
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Proof. Choose 9 > such that 



\ ( 1 - 2^±I | - I > 0. 

V 



Then 

Ca PqtN { Pl (w {m) y > (2 m ) S 



k=l 



71=1 

oo 2 n 



(m),l 
Fk-i fc 



QT1 ) QT1 



< 



g,JV _fe 



n=l k=l 

oo 2" 



(m),l 

c-l 

2" ' 2" 



> C e (2 r 



2' 



< EE c °p 



71=1 fc=l 



j / 1 \ f 



-, 27V ' 



2" 



where 



9m,n,k 



(m),l 



2N 



Thus, by using the capacity maximal inequality and (14. 16ft : 



\\9m,n,k\\q,N < 

we obtain 

Ca PqtN { Pl (w^Y > (2 m ) 5 } 



Cq" (£) 



iV 



for n > in, 
for n < m, 



< 



n=l 



Cj (2 r 



j / 1 \ p 



9+1 



-27V 



N 



9+1 
• / 1 \ P 



-27V 



2 m 
22" 



/V 



< c 



. TV— m / -, \ iVfl-22±l)-l 



2" 



71=1 



2" 



n>m 



\ 9"* / ' J \ 2 n \2 n 



TV 1-22+i 
V p 



< C 



2" 



TV- 



Proposition 4.8 Le^ p G (2, 3), q > I, N G N, and /3 G (0, ^). Taen, /or any e > there 
is a constant C depending only on p, d, q, N and (3 such that 



Cap qtN {p l {w^) Pl {w^ +1 \w^)> (±Y\<c(± 



(4.23) 



for all m G N. 
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Proof. Choose N eN, 8 > 0,5 > such that 



+ l\ 
N{1- 2 J - 1 > £, 

V ) 

p-2-l>0,P + 6 + 5< P -^-^ 
y N 2 2N 



and 



Then, since 



p-2 



+ (3 + 6) 



2N 
P 



1 > 2e . 



c {p 1 (^r>(2 m ) 5 }u|p 1 ( 



w {m+l) w (m))P > 



13+5' 



so that 



< Cap 9i „ {pi(^ (m) ) p > (2 m ) 5 } + Cap 9>JV jp^^ 1 ) V m) ) p > 



/3+<5' 



< c 



Putting ( f)2.1ip . ( 14.2 ip and ( I4.23[) together we may conclude the following 

Theorem 4.9 Let p e (2,3). T/ien /or any e > 0, q > 1 and iV 6 N, t/iere are (3 > 0, 
constants C\ > and C2 > depending only on p, q, N , d and e such that 

Cap q>N L p (w^ +1 \w^) > Ct (^y\ < C2 Vm E N. (4.24) 

We are in a position to prove the main theorem 12.31 By the capacity version of the 
Borel-Cantelli lemma, ( I4.24p implies that 



A = I w E W : d P 



00 



m=l 



is slim, that is, Cap q ^ {A} = for any q > 1 and iV 6 N, so that 

{w eW : (w im) ) is not Cauchy in G p n{R d )} 
is slim, and therefore — > w in G p ri(J$, d ) quasi-surely. 
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